Differentially Private Simple Linear Regression

نویسندگان

چکیده

Abstract Economics and social science research often require analyzing datasets of sensitive personal information at fine granularity, with models fit to small subsets the data. Unfortunately, such fine-grained analysis can easily reveal individual information. We study regression algorithms that satisfy differential privacy , a constraint which guarantees an algorithm’s output reveals little about any input data record, even attacker side dataset. Motivated by Opportunity Atlas high-profile, small-area tool in economics research, we perform thorough experimental evaluation differentially private for simple linear on tens hundreds records—a particularly challenging regime privacy. In contrast, prior work focused multivariate large or asymptotic analysis. Through range experiments, identify key factors affect relative performance algorithms. find based robust estimators—in particular, median-based estimator Theil Sen—perform best (e.g., datapoints), while Ordinary Least Squares Gradient Descent better datasets. However, also discuss regimes this general finding does not hold. Notably, analogues Theil–Sen (one was suggested theoretical Dwork Lei) have been studied regression.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Differentially Private Local Electricity Markets

Privacy-preserving electricity markets have a key role in steering customers towards participation in local electricity markets by guarantying to protect their sensitive information. Moreover, these markets make it possible to statically release and share the market outputs for social good. This paper aims to design a market for local energy communities by implementing Differential Privacy (DP)...

متن کامل

Simple Linear Regression

One of the most widely used statistical techniques is simple linear regression. This technique is used to relate a measured response variable, Y, to a single measured predictor (explanatory) variable, X, by means of a straight line. It uses the principle of least squares to come up with values of the “best” slope and intercept for a straight line that approximates the relationship. By means of ...

متن کامل

Simple Linear Regression

Rationale Frequently decision-making situations require modeling of relationships among business variables. For instance, the amount of sale of a product may be related to its advertising expenditures for marketing, the health of the economy as measured by the stock market performance, and the number of sales people etc? The regression analysis provides tools for modeling a "response" variable ...

متن کامل

Revisiting differentially private linear regression: optimal and adaptive prediction&estimation in unbounded domain

We revisit the problem of linear regression under a differential privacy constraint. By consolidating existing pieces in the literature, we clarify the correct dependence of the feature, label and coefficient domain in the optimization error and estimation error, hence revealing the delicate price of differential privacy in statistical estimation and statistical learning. Moreover, we propose s...

متن کامل

Correlation and simple linear regression.

In this tutorial article, the concepts of correlation and regression are reviewed and demonstrated. The authors review and compare two correlation coefficients, the Pearson correlation coefficient and the Spearman rho, for measuring linear and nonlinear relationships between two continuous variables. In the case of measuring the linear relationship between a predictor and an outcome variable, s...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Proceedings on Privacy Enhancing Technologies

سال: 2022

ISSN: ['2299-0984']

DOI: https://doi.org/10.2478/popets-2022-0041